ACCEPTED BY IEEE TRANSACTIONS ON SIGNAL PROCESSING, JAN. 2013 



I 



Two-Way Training for Discriminatory Channel 
Estimation in Wireless MIMO Systems 

Chao-Wei Huang, Tsung-Hui Chang, Xiangyun Zhou, and Y.-W. Peter Hong 



o 

O 
(N 



> 

o 



X 



Abstract — This work examines the use of two-way training 
to efficiently discriminate the channel estimation performances 
at a legitimate receiver (LR) and an unauthorized receiver 
(UR) in a multiple-input multiple-output (MIMO) wireless 
system. This work improves upon the original discriminatory 
channel estimation (DCE) scheme proposed by Chang et al 
where multiple stages of feedback and retraining were used. 
While most studies on physical layer secrecy are under the 
information-theoretic framework and focus directly on the data 
transmission phase, studies on DCE focus on the training phase 
and aim to provide a practical signal processing technique to 
discriminate between the channel estimation performances (and, 
thus, the effective received signal qualities) at LR and UR. A 
key feature of DCE designs is the insertion of artificial noise 
(AN) in the training signal to degrade the channel estimation 
performance at UR. To do so, AN must be placed in a carefully 
chosen subspace based on the transmitter's knowledge of LR's 
channel in order to minimize its effect on LR. In this paper, 
we adopt the idea of two-way training that allows both the 
transmitter and LR to send training signals to facilitate channel 
estimation at both ends. Both reciprocal and non-reciprocal 
channels are considered and a two-way DCE scheme is proposed 
for each scenario. For mathematical tractability, we assume 
that all terminals employ the linear minimum mean square 
error criterion for channel estimation. Based on the mean 
square error (MSE) of the channel estimates at all terminals, 
we formulate and solve an optimization problem where the 
optimal power allocation between the training signal and AN 
is found by minimizing the MSE of LR's channel estimate 
subject to a constraint on the MSE achievable at UR. Numerical 
results show that the proposed DCE schemes can effectively 
discriminate between the channel estimation and hence the data 
detection performances at LR and UR. 

Index terms Two-way training, Channel estimation. Physical 
layer secrecy, MIMO 
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I. INTRODUCTION 

Due to the broadcast nature of the wireless medium, com- 
munication between wireless terminals is often susceptible to 
potential eavesdropping by unauthorized receivers. Therefore, 
as wireless technology becomes more prevalent, the need for 
discriminating between the signal reception performance at a 
legitimate receiver (LR) and that at an unauthorized receiver 
(UR) has increased. Motivated by this demand, the concept 
of physical layer secrecy have been studied extensively in 
recent years and methods that utilize properties of the wireless 
channels to achieve the desired performance discrimination 
have been proposed. From an information-theoretic viewpoint, 
the difference in the channel condition at different receivers 
can be exploited to ensure a non-zero communication rate 
between the transmitter and LR under the perfect secrecy 
constraint |[T]-|l4l, where the notion of perfect secrecy means 
that no UR is able to infer any information from the received 
signal broadcasted by the transmitter From a signal process- 
ing perspective, artificial-noise-aided multiuser beamforming 
schemes Q-Q and space-time coding schemes (H can be 
adopted to enhance signal reception at LR while Umiting the 
received signal quality at UR. 

A. Motivation and Related Work 

Most studies in the literature on physical layer secrecy, e.g., 
Q-Hl, focus on the design of the data transmission phase 
while often assuming the availability of perfect channel state 
information. In practice, channel knowledge is typically ob- 
tained through training and channel estimation, and its quality 
can have a significant impact on the receiver performance 
191 , ifTOl . Intuitively, if UR has a poorer channel estimation 
performance than LR, then UR would have a higher detec- 
tion error probability when overhearing the transmission of 
secret messages. Motivated by this observation, the authors in 
im proposed a novel training scheme, called discriminatory 
channel estimation (DCE), that can provide a better channel 
estimation performance for LR compared to that for UR. 
Different from the information-theoretic works [IT]-|l3], the 
DCE scheme in 1 1 1 1 takes a more practical viewpoint and 
is a signal processing technique for discriminating between 
the channel conditions of LR and UR. 

The key feature of the DCE scheme ifTTl is the insertion of 
artificial noise (AN) in the training signal. The AN is carefully 
placed in a subspace so that it jams UR while the interference 
caused to LR is minimized. To this end, the transmitter re- 
quires knowledge of LR's channel. In the original DCE scheme 
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proposed in IfTTI . this was achieved by using a prehminary 
training stage followed by multiple stages of feedback-and- 
retraining. Specifically, in the preliminary training stage, a 
pure pilot signal is first sent by the transmitter to allow for 
a rough channel estimate at LR. Then, LR feeds back this 
channel estimate to the transmitter, who then sends a new pilot 
signal inserted with AN to disrupt the channel estimation at 
UR. The pilot signal power used in the preliminary training 
stage must be relatively small since, otherwise, UR is also 
able to obtain a good channel estimate. In addition, the AN 
signal used in the retraining stage must be placed in the null- 
space of LR's estimated channel to minimize its effect on LR. 
Through each stage of feedback and retraining, knowledge of 
LR's channel at the transmitter is gradually refined, allowing 
AN to be placed more precisely in the desired signal subspace 
and the pilot signal power to be increased without benefiting 
the channel estimation at UR. The main drawback of this 
multi-stage feedback-and-retraining scheme [)1 Ij is the large 
training overhead and the high design complexity required. 
The cost and the quality of the feedback link may also limit 
its application in practice, but was not considered in lil li . 

B. Our Approach and Contribution 

The main contribution of this work is to propose new 
and efficient DCE schemes based on the two-way training 
methodology. The idea of two-way training, which has been 
studied for non-secrecy applications, e.g., in lfT9l - ll24l . allows 
both the transmitter and the receiver to send pilot signals in a 
collaborative manner so that channel estimation is enabled at 
both ends. This is particularly useful in achieving DCE since 
the reverse training signal sent by LR in a two-way training 
scheme will not benefit UR in obtaining any information about 
the channel between itself and the transmitteiQ. This advantage 
is not enjoyed by conventional one-way training schemes since 
any pilot signal sent by the transmitter will help UR improve 
its estimate of the channel between itself and the transmitter 
In this work, two-way DCE schemes are designed for both 
reciprocal and non-reciprocal channel models. The former is 
a reasonable model for time-division duplex (TDD) systems 
whereas the latter is often used to model frequency-division 
duplex (FDD) systems. For reciprocal channels, the proposed 
two-way DCE scheme requires only two stages of training, that 
is, a reverse training stage and a forward training stage. For 
non-reciprocal channels, only an additional round-trip training 
stage is needed, in which the transmitter first broadcasts a 
random signal only known to itself and LR echoes the signal 
back using an amplify-and-forward strategy. In both cases, AN 
is inserted into the pilot signal in the (final) forward training 
stage to achieve the desired DCE performance. Compared 
to the multi-stage feedback-and-retraining scheme proposed 
in ifTTl . the newly proposed two-way training schemes drasti- 
cally reduce the overall training overhead. 

The proposed two-way DCE schemes can conceptually be 
derived under any channel estimation criterion at the three 

'However, the revere training signal enables UR to estimate the channel 
between itself and LR. This may not be desirable if LR also has secret 
messages to transmit; see more discussions in Remark 5. 



terminals. For tractability and for gaining useful insights, in 
this paper, we assume that all terminals employ the linear 
minimum mean square error (LMMSE) channel estimator, and 
derive the resulting mean square error (MSE) of the channel 
estimates obtained at both LR and UR. These analysis results 
are then used to compute the optimal power allocation between 
the pilot signals and AN across different training stages. This is 
obtained by solving an optimization problem that aims to min- 
imize the MSE of LR's channel estimate subject to a constraint 
on the MSE of UR's channel estimate and individual training 
energy constraints at the transmitter and LR. For reciprocal 
channels, we show that the optimal transmit powers of reverse 
training, forward training, and AN have simple close-form 
expressions. For non-reciprocal channels, the power allocation 
problem cannot be easily solved, but an approximate solution 
can be obtained by employing the monomial approximation 
and the condensation method IZSl . 1261 often used in the 
field of geometric programming (GP). Numerical results are 
provided to demonstrate the effectiveness of the proposed 
schemes. 

The remainder of this paper is organized as follows. In 
Section [III the system model and the proposed two-way DCE 
scheme for reciprocal channels are presented. In Section Illl 
the two-way DCE scheme is extended to non-reciprocal chan- 
nels. Numerical results are provided in Section |IV] and, finally, 
the conclusion is given in Section [V] 

Notations: Upper-case and lower-case boldfaced letters are 
used for matrices, e.g., X, and vectors, e.g., x, respectively. 
Moreover, X^, X* and X^ denote the transpose, the complex 
conjugate and the Hermitian of the matrix X, respectively. 
Let Omxn be the A/-by-iV zero matrix and let Im be the 
M-by-Af identity matrix. Tr(-) denotes the trace of a square 
matrix, || • || denotes the Frobenius norm, and vec(-) is the 
operator which stacks the columns of a matrix into a vector 
The symbol (g) denotes the Kronecker matrix product, IE{-} 
denotes the expectation operator and diag(ai, . . . ,a]\[) is the 
N X N diagonal matrix with diagonal elements ai, . . . , a^. 

II. Two-Way DCE Design for Reciprocal Channels 

A. System Model 

Consider a wireless MIMO system that consists of a 
transmitter, a legitimate receiver (LR), and an unauthorized 
receiver (UrJI, which are equipped with Nt, Nl and Nir 
antennas, respectively, as shown in Fig. [T] We assume that 
Nt > Nl. Moreover, the channel fading coefficients are 
assumed to remain constant during each transmission block, 
which consists of a training phase and a data transmission 
phase. In this work, we focus on the training phase and aim 
to discriminate the channel estimation performances at LR and 
UR. Let the downlink channel matrix from the transmitter 
to LR be denoted by e c^*"""^. The entries of 
are assumed to be independent and identically distributed 
(i.i.d.) complex Gaussian random variables with zero mean and 

^For ease of presentation, we will focus on the scenario with one LR and 
one UR throughout the paper The presented methods, neveitheless, can be 
easily extended to the scenario with multiple LRs and URs, following the 
same spirit as in II II . 
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Fig. 1. A wireless MIMO system consisting of a transmitter, a legitimate 
receiver (LR) and an unauthorized receiver (UR). 



variance af^^ {i.e., CJ\f{Q,af^^)). Similarly, the uplink channel 
from LR to the transmitter is denoted by H„ e C^'^^^', 
whose entries are i.i.d. with distribution CA/'(0,ct^ ). The 
forward channel from the transmitter to UR is denoted by 
G g C^*^^^ with entries being i.i.d. CJ\f{0, a'^g); while the 
channel from LR to UR is denoted by F G C^^""^^'. In the 
rest of the paper, we assume that the transmitter, LR and UR 
are separated enough so that (H„), G and F are distinct 
and statistically independent of each other. 

In the case of reciprocal channels, the uplink and downlink 
channel matrices can be specified by a single channel matrix 
H such that H = = G ^n^xMl^ xhe variance of 
each entry can be denoted by ct^, where ct^ = crf^^ = ct^ . 
With such channel reciprocity, the transmitter can obtain an 
estimate of the downlink channel by taking the transpose of 
the estimated channel matrix obtained through reverse training, 
i.e., training based on the pilot signals sent from LR to 
the transmitter In the following subsections, we show the 
detailed steps of the proposed two-way DCE scheme and 
the associated LMMSE channel estimation performance in 
reciprocal channels. 

B. Training Scheme and Channel Estimation Performance 

Step I (Channel Acquisition at Transmitter via Reverse 
Training): The first step of the two-way DCE scheme is to 
allow the transmitter to obtain a reliable estimate of its down- 
link channel to LR without benefiting the channel estimation 
process at UR. Different from ifTTI . where the availability of a 
noiseless feedback link was considered, our proposed two-way 
DCE scheme requires the transmitter to estimate the downlink 
channel by itself through the exchange of training signals 
between the transmitter and LR. In the reciprocal channel case, 
this can be simply achieved by having LR send a reverse 
training signal to the transmitter Specifically, in the reverse 
training stage, LR first sends a training signal 



Xr = 



Nr 



••TrXNl 



(1) 



is the pilot matrix 



to the transmitter, where £ 
that satisfies C^Cl = Inl^ is the transmission power for 
reverse training, and t/j is the reverse training length. Note 
that the choice of using an orthonormal pilot matrix {i.e., 
Inl) is due to its optimality in minimizing the 



channel estimation error in point-to-point channels, as shown 
in ifTOI . In the remainder of this paper, we shall sometimes 
denote the reverse training energy by En while keeping in 
mind that 

£r ^ 'Prtr. 



The signal received at the transmitter is given by 



W, 



(2) 



where W g C^'' ^ ^' is the AWGN matrix with elements being 
i.i.d. CAA(0,a|). 

The reverse training signal sent by LR allows the transmitter 
to obtain an estimate of the downlink channel by taking the 
transpose of its upUnk channel estimate. By employing the 
LMMSE estimator 1271 . the estimate of H at the transmitter 
can be written as 



H = (a^Xf (a^XiXf + ap,J-^Y,)^ ^ H + AH (3) 



'if 



where AH £ C^'^^^ is the estimation error matrix with 



C{AH(AH)^} = iVi 



1 



'-Nti 



(4) 



and dl, is the noise power at the transmitter 

Step II (Forward Training with AN): After obtaining the 
downlink channel estimate, i.e., H, in Step I, the transmitter 
then sends a forward training signal to enable channel esti- 
mation at LR in Step II. To degrade the channel estimation 
performance of UR while not jamming LR, the transmitter 
carefully inserts AN in the training signal. The forward 
training signal is given by 



X,, = 



Ct 



AK|, 



(5) 



where Ct £ C^^^^' is the pilot matrix with Tr(Cf C*) = Nt, 
T'i? is the pilot signal power in this stage, and tf is the training 
length. For ease of notation, we define 

as the forward pilot signal energy. Here A £ C"^^^'^*"^^' 
is AN matrix whose entries are i.i.d. CAf{0,a^) and are 
statistically independent of the channels and noises at all 
terminals; Kg £ £,^tx{Nt-NL) jg ^ rnatrix whose column 
vectors form an orthonormal basis for the left null space of 



H, that is, KgH = 



— ' ' -{Nt-NL)xNL ^"-g^^H -ivf-i.i,- 

Notice from (|5j that AN is superimposed on the training 
signal and placed in the left null space of H to minimize 
its interference on LR. The received signals at LR and UR are 
respectively given by 



and Kg Kg = I 



Nt-NL- 




AKgH 



AKgG 

rl 



w, 



V, 



(6) 
(7) 



where W £ C^^^^^ and V £ C^^''^^ are the additive 
white Gaussian noise (AWGN) matrices at LR and UR, re- 
spectively, with entries being i.i.d. CAf{0, cr^„) and CA/'(0, cr^). 
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respectively. Note that, since H 
equation Q can be rewritten as 



H 



AH and KgH 



C,H 



AKgAH + W = CH 



W, 



0, 



(8) 



where C = ./4f and W = -AK^^AH + W. Also note 
that, due to the presence of A, W is statistically uncorrected 
with H, i.e., E{W^H} = 0. 

By assuming the LMMSE estimator, the channel estimate 
at LR can be expressed as 



Hi — RhC^(CRhC^ + R-w) 



(9) 



where _Rh = E{HH^} = NlctIIn^ and R^ = 
E{WW^}. The normalized mean squared error (NMSE) of 
Hi is defined as 



NMSEl 



Tr(E{(H-Hi)(H-HL)^} 



Tr 



NtNL 



NtNL 



(10) 



I 



By the fact that H and AH are uncorrected due to the or- 
thogonality principle [127], and by ^ and KgKg 
the covariance of W can be written as 

Rw = (E{||K|AHf I'T^ + NLal) I,, 



Nr 



(Nt - Nl) 



'H 



NL<yl 



(11) 



Then, by substituting ( fTTT i into ( fTOl i, we have 



NMSE,^-Tr^-,I., 



Nt 



{Nt^NL){^ + 



(12) 



Similarly, the NMSE of the estimate of G at UR can be 
computed as 



NMSEr/ = 




Nt {Nt - Nl^IcjI + al 



(13) 



Notice, from (fT2] i and (flJT l, that increasing the AN power {i.e., 
(7^) increases the NMSE at both receivers, but the effect can 
be reduced at LR by increasing the reverse training energy 
£fi. Hence, under a total energy constraint, the training and 
AN powers must be carefully chosen to ensure sufficient 
discrimination between the channel estimation performances 
at the two receivers. 

Remark 1. In view of the fact that the proposed forward 
training signal in ^ contains AN, an important question to 
ask is that whether UR can ignore the AN-aided training signal 
and directly employ some blind data detection or channel 



estimation methods in the data transmission phase. For exam- 
ple, if the transmitter uses space-time coding schemes in the 
data transmission phase, UR may employ the blind detection 
methods in lfT2l - lfT4l or the blind channel estimation methods 
in ifTSl . lITSl . However, these blind methods cannot work 
properly without cooperation of the transmitter Specifically, if 
the transmitter uses space-time codes that are not identifiable 
lfT6l . ITtI , UR would suffer from nontrivial code and channel 
rotation ambiguities. It has been shown in 111 8] that the rotation 
ambiguities can make UR have a detection error probability 
equal to one, provided that the information symbols satisfy a 
constant modulus propertjH As a result, UR may still need 
to exploit the training signal for channel estimation, even 
though it is jammed by the AN signal. Further quantitative 
analysis evaluating the performance of other data transmission 
schemes and blind detection/channel estimation methods under 
the proposed DCE scheme will be an interesting direction for 
future research. 

C. Optimal Power Allocation between Pilot Signal and AN 

The closed-form NMSE expressions obtained in the previ- 
ous subsection show explicitly the effect of the reverse training 
power (or energy), the forward training power (or energy) 
and the AN power on the channel estimation performances 
at LR and UR. From a designer's point of view, it is desirable 
to utilize the available power (or energy) in an efficient 
way whilst achieving a satisfactory DCE performance. In the 
following, we consider the problem of allocating the pilot 
signal and AN powers in the reverse and forward training 
stages with the goal of minimizing the channel estimation error 
at the LR whilst keeping the estimation error at UR above a 
certain threshold. The proposed optimization problem is given 
as follows: 



min NMSEl 



subject to (s.t.) 



NMSEcT > 7, 
Sb<£l, 

£f + [Nt - Nl)<jItf < Bt 



(14a) 

(14b) 
(14c) 
(14d) 



The optimization problem is constrained by a required lower 
limit on UR's NMSE in ( I14bb . and individual energy con- 
straints at LR and the transmitter in (114c) and (ll4dK respec- 
tively. 

It should be noted that UR's NMSE constraint, i.e., 7, mi 
should be chosen such that 



1 



< 7 < 



(15) 



where the term on the left hand side is the minimum achievable 
NMSE at UR (when the transmitter does not use any AN, i.e., 
cr^ = 0) and the term on the right hand side stands for the 
worst NMSE performance at UR (which is achieved when the 

^In addition, the transmitter may employ the AN-aided OSTBC scheme in 
[8] or the AN-aided beamforming schemes (5]-G]- Since the data signals of 
these schemes also contain AN, it would be even more difficult for UR to use 
the blind methods to extract the information symbols or estimate the unknown 
channel. 



ACCEPTED BY IEEE TRANSACTIONS ON SIGNAL PROCESSING, JAN. 2013 



5 



mean of G, i.e., zero, is taken as the channel estimate). For 
ease of use later, let us define 

7 = - ^) > (16) 

so that the condition in ( fTSl l can be reduced to 

< 7 < £t. (17) 

The power allocation problem in (fl4l i is a non-convex 
optimization problem involving three variables (£/?, f^, cr^)- 
Interestingly, we show in the following proposition that, for the 
case of orthogonal forward pilot matrices (i.e., the case where 
CfCt = IatJ, problem (fT4l i actually has simple closed-form 
solutions. 

Proposition 1. Consider problem ( 114b with 7 chosen accord- 
ing to ([T5]l. If 

f,^NL f 44 - 4) > Sl, (18) 

the optimal {£r,£f,cf'^) is given by £^ ~ 0, £p = j and 
(ct^)* = (i.e., no need of reverse training and AN). On the 
other hand, if ^ < £l, the transmitter and LR use all the 
available energy so that = £l, 

Op — Ct — 5-7—5-, 

and 

Proposition [1] implies that, if UR has a relatively poor 
channel condition (i.e., sufficiently small a'g/al), then both 
AN and reverse training are not needed; otherwise, LR should 
use all its energy for reverse training and the transmitter needs 
to employ AN in order to constrain UR's MSB above the 
required threshold value 7. The proof of Proposition [T]is given 
in Appendix [A] Appendix |A] actually provides the proof for a 
more general formulation which, compared to problem (IT4l . 
has an additional total energy constraint 

£r + £f + (Nt - NL)alTF < £tou 

where £tot represents the total energy budget. This total energy 
constraint limits the total amount of energy consumed by the 
transmitter and LR in the training phase. We are interested 
in such general formulation because it may be useful in the 
system design stage for understanding the power tradeoff 
between the transmitter and LR and that between the training 
phase and data transmission phase. 

Two remarks regarding the proposed DCE scheme are in 
order 

Remark 2. It is interesting to note that Proposition [T] gives 
the solution to the optimization problem for the orthogonal 
forward training matrix with full rank, i.e., C^Ct = IjVf 
However, the rank of Ct does not need to be Nt in gen- 
eral. Given that the rank of Cj is equal to K{< Nt), it 
is shown in Appendix |B] that the optimal Ct must satisfy 
CfCt = UcDUf, where D = diag(di, . . . , dif , 0, ...,0) 



with di = ■ ■ ■ = dx = ^ and is an Nt x Nt unitary 
matrix. If a rank deficient pilot matrix is considered instead, 
i.e., K < Nt, one can choose an arbitrary Nt x Nt unitary 
matrix for Uc and obtain the optimal {£ii, £f, cr^) for a given 
K value by following the same derivations as in the proof of 
Proposition [T] The rank of the forward training matrix can be 
further optimized to minimize the NMSE at LR. 

Remark 3. The training lengths in the reverse and forward 
training stages, i.e., tb, and tf, can also be optimized. Note 
that training on the reverse link is not affected by the presence 
of UR and, thus, can be viewed as training on a point-to-point 
link. Therefore, by ifTOl . the optimal reverse training length 
Tfi is equal to the number of antennas at LR {i.e., Nl) since 
it minimizes the training overhead without compromising the 
channel estimation performance. One can also show that the 
optimal forward training length tf is given by the number of 
antennas of the transmitter, i.e., Nt. To show this, observe that 
the optimal (f j^, £*p, af) of dHl for some Tp = Tp^ satisfies 
the constraints of (fT4l i even when Tp reduces to a smaller value 
tfi < Tp^, i-e., {£'^, £f, (t^ ) is also feasible to (O with 
Tp = Tp^. This implies that a smaller Tp corresponds to a 
larger feasible set for {£r, £f, <^a) ™ 'O; and thus a smaller 
value of optimal NMSE^ can be obtained. Consequently, 
the minimum NMSE^ is achieved when Tp is equal to its 
minimum possible value, i.e., the number of antennas at the 
transmitter Nt- 

III. Two-Way DCE Design for Non-Reciprocal 
Channels 

In this section, we consider the case of non-reciprocal 
channels. Without channel reciprocity, the downlink channel 
gain cannot be directly inferred from the uplink channel 
gain. In this case, the knowledge of the downlink channel 
at the transmitter can be obtained via reverse training plus 
an additional round-trip training stage that utilizes an echoed 
signal (from the transmitter to LR and back to the transmit- 
ter) to obtain the combined downlink-uplink channel at the 
transmitter The proposed two-way DCE training scheme is 
detailed below. 

A. Training Scheme and Channel Estimation Performance 

Step I (Channel Acquisition at the Transmitter with 
Round-Trip and Reverse Training): In the round-trip train- 
ing stage, the transmitter first broadcasts a random signal, only 
known to itself, which is then echoed back to the transmitter by 
LR. The effective channel seen at the transmitter is equal to the 
composition of the uplink and downlink channels. Specifically, 
let Cto G c^toxJVt j.jjg pijot matrix that is randomly 
generated with normalized power, i.e., Tr(C^Cfo) = Nt- The 
signal sent by the transmitter is given by 

X,o = /^C,o, (19) 

where Vto is the pilot signal power and Tto is the training 
length in this stage. Note that Cto is known only to the 
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transmitter (but not to LR or UR0) since it is randomly 
generated before each transmission. For ease of notation, we 
define the round-trip training energy as Eto = VtoTto- The 
received signal at LR is given by 



Y 



LO 



Wo, 



(20) 



where Wq G C'"^^^ is the AWGN matrix with entries that 
are i.i.d. with distribution CA/'(0,cr^). Upon receiving Y^o, 
LR amplifies and forwards it back to the transmitter. The 
echoed signal at the transmitter is given by 



Yti = aYioH„ + Wi 



Wi, 



(21) 



where Wi G C^"'^^' is the AWGN matrix at the transmitter 
with entries being i.i.d. with distribution C7V(0, cr,|). The 
amplifying gain at LR is given by 



a 



VliTw 



(22) 



where Vli is LR's transmission power and £li ^ VliTio is 
the energy spent on echoing the signal. With the knowledge 
of Xto, the transmitter can obtain an estimate of the combined 
downlink and uplink channels, i.e., HdH„. However, to obtain 
an estimate of the downlink channel H^, the transmitter must 
first obtain an estimate of the uplink channel H„. This can 
be achieved through reverse training, which is the same as the 
one described in the reciprocal channel case. 

In the reverse training stage, LR sends a training signal 
Xl2 = \f^CL2 e C^^-^""^^ to enable upHnk channel 
estimation at the transmitter. Here, is the pilot matrix 
satisfying Cfj^^ia = IjVi, £l2 is the reverse ti-aining energy, 
and is the training length. The received signal at the 
transmitter is given by 



Xl2H,, 



Wo 



Cl2H„+W2, (23) 



where W2 is the AWGN matrix with entries being i.i.d. with 
distribution CN{Q,a%). The transmitter can then obtain the 
LMMSE estimate of the uplink channel as 



Similar to that in (|3]l and (HJi, we can write 
H,, = H,, + AH,, , 



(24) 



(25) 



UR may attempt to exploit the AN-free signal Cto to obtain some useful 
information about its channel G. For example, if Njj > Nt or the distribution 
of Cto is known, UR can estimate the subspace of G from the received 
signal. However, like the blind methods discussed Remark 1, this subspace 
information still suffers from a nontrivial rotation ambiguity about the true 
channel G. Besides, the subspace estimation quality could be very poor due 
to the short length of Cto and the presence of additive noise. 



where AH„ is the estimation error matrix which is complex 
Gaussian distributed with zero mean and correlation matrix 



E{AHf(AH„)}=ivJ-^ 



L2 



(26) 



Note that AH,i and H„ are statistically independent since they 
are both Gaussian and are uncorrected with each other due 
to the orthogonality principle ||271 . The transmitter can then 
utilize the uplink channel estimate H„ to compute an estimate 
of the downlink channel H^. 

Specifically, given H„ at the transmitter, we can rewrite the 
echoed signal in ( 1211 1 as 

Yii = aX4oHd(H„ - AHJ + aWo(H„ - AH„) + Wi 
= aXtoHdH,, + (aWoH„ - aXtoHdAH„ 

-aWoAH„ + Wi). (27) 

For ease of analysis, let us define yti = vec(Yti), ~ 
vec(Hrf), Wq = vec(Wo), and wi = vec(Wi) as Jhe 
respective vector counterparts of Yfi, H^, Wq and Wi 
obtained by stacking the column vectors of each corresponding 
matrix. By the Kronecker product property that vec(ABC) = 
(C-^ ® A)vec(B), one can express (|27T i as 



yti = a(Hj ® Xto)hrf H 
-a(AHj«)Xto)hd 



- a(AH^ ® IatJwo 



wi. (28) 



Let Cto be ^ unitary matrix such that CtoC^ = C^Cto = 
Iwjfl By the fact that the equivalent noise term q;(HJ ® 
lArJwo-a(AHj(g)Xto)hd-a(AHj®lArJwo + wi in ^ 
is statistically uncorrelated with h^, the LMMSE estimate of 
the downlink channel at the transmitter (denoted by h.d,t) 
can be computed as 



h: (h^h:)+/?i 



Nt 



= hd + Ah, 



(29) 



where 



£l2 



--to 



(30) 

and Ahd,t G ^NtN^xi jj^g estimation error vector at the 
transmitter. The corresponding matrix form of f is given 



^Note that the pilot matrix Cto need not be a unitary matrix in general. 
However, the NMSE performance obtained with a generic round-trip pilot 
matrix cannot be expressed in a closed form and, thus, the optimal pilot 
structure is difficult to obtain. In this work, we aim to provide a design 
that can be efficiently implemented and whose performance can be easily 
characterized. With the choice of unitary pilot matrices, we are able to derive 
an accurate closed-form approximation of the NMSE performance and further 
utilize it to efficiently optimize the power (energy) allocation among the pilot 
signals and AN in different stages. 
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by 



xXgYn ((HfHu) 
Hd + AH.t, 



H 



H 



(31) 



where and AH^ t is the matrix form of he; and Ahrf t, 
respectively. The covariance matrix of Ah^ t conditioned on 
H„ is given by 



E{Ahd,t{Ahd,t)"\nu} 



-to 



-Hr.Hf 



I 



N, 



® I 



(32) 



Step II (Forward Training with AN): In the forward 
training stage, the transmitter sends AN along with the training 
signal to discriminate the channel estimation performances 
between LR and UR. The detailed description has been 
presented earlier in Section II-B. For notational consistency 
in this section, we modify the subscripts of the symbols in the 
expressions of the received signals at LR and UR as 




AK 



+ w. 



AKg G 

tid.t 



(33) 



(34) 



and the forward training length is denoted as Tts instead of 

Tp. 

Comparing with the design for reciprocal channels de- 
scribed in Section HI] the DCE in the case of non-reciprocal 
channels requires more time to complete since an additional 
round-trip training stage is used. To keep the training overhead 
low, we consider a design with the minimum training lengths, 
i.e., Tto = Nt, TL2 = Nl, Tt3 = Nt, and choose Cto to be a 
unitary matrix such that C^Cjo = CtoC^ = Note that 
the choices of tl2 = and Tts = Nt are indeed optimal 
under a total and/or individual energy constraints as discussed 
in Remark |3] of Section [III 

Unlike the reciprocal channel case in Section III-BI it is 
difficult to obtain a close-form expression for the NMSE at LR 
from ( |33] |. In Appendix |C] we instead derive an approximate 
NMSE at LR by assuming that: 1) given H^, the LMMSE 
estimate of in dsTl i. i.e., H^ t, is statistically independent 
of the associated error matrix AH^ t; 2) the transmitter and LR 
have sufficiently large numbers of antennas, i.e, Nt, Njj ^ 1. 
The obtained approximation of the NMSE at LR is given by 
i35[ which is at the top of the next page, where 



2 A 



at £. 



L2 



'yi£L2 + NL<7i 



(36) 



and /3, as recalled from (|30] |. is a function of the energy values 
£ta and £l2- While the two assumptions are in general not true 
(in general H^,* and AH^ t are only statistically uncorrelated, 
and Nt and Njj are finite), our numerical results show that 
the approximate NMSE^ presented above is actually quite 



accurate under practical settings, as will be shown in the 
numerical result section. 

The NMSE performance at UR can be computed in a similar 
fashion as in the reciprocal case, which is given by 



NMSE 




Nt {Nt 



(37) 



B. Optimal Power Allocation between Training and AN Sig- 
nals 

The effect of power allocation on the NMSE performance is 
much more complex in the non-reciprocal case. Nonetheless, 
we can formulate an optimization problem, similar to that in 
the reciprocal case, where we aim to minimize the channel 
estimation error at LR whilst keeping the estimation error at 
UR above a threshold. The optimization problem is given as 
follows: 



mm 

£L2,£t3,crl>0 
S.t. 



NMSEl 

NMSEj/ > 7 

£to + £t3 + {Nt - 

£li + £l2 < £l- 



NLhlNt < £t 



(38a) 

(38b) 
(38c) 
(38d) 



Here, £t and £l are the individual energy constraints at the 
transmitter and LR, respectively. Since this problem is not 
easily solvable, we resort to the monomial approximation and 
the condensation method often adopted in the field of GP ||25]| . 
1261 to obtain an efficient solution. Detailed description of the 
numerical algorithm can be found in |i28| and are omitted in 
this paper since these approaches are standard in the field of 
GR 

Remark 4. Similar to the discussion in Remark |2] of Sec- 
tion [III one can also show that the optimal structure of Ct3 is 
given by C^Cta = UoDUg, where D = diag((ii, . . . , d^, 
0, . . . , 0) with di = ■ ■ ■ = (Ik ^ and is the matrix 
whose columns consist of eigenvectors of C^C^. For any 
choice of K{< Nt), the optimal power allocation can be 
obtained by performing the same monomial approximation 
and condensation method described in 1281 . The optimal K 
can then be found by comparing the solutions for all possible 
values of K. 

Remark 5. As the final remark, it would be interesting to 
qualitatively compare the proposed two-way DCE scheme and 
the original feedback-and-retraining DCE scheme in ifTTI . In 
terms of training overhead, it is easy to see that the proposed 
two-way DCE scheme is more efficient than the feedback- 
and-retraining DCE scheme, since the two-way DCE scheme 
requires at most three transmissions by the transmitter and/or 
LR (e.g., for the non-reciprocal channels, we require one 
round-trip transmission, one reverse training and one forward 
training with AN) while the feedback-and-retraining DCE 
scheme, even under the assumption of ideal feedback, usually 
requires around five stages of feedback and retraining (equiva- 
lent to 10 transmissions by the transmitter and LR) in order to 
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NMSEi « — Tr 

Nt 



'-Nt 



■■a 



Nt 



(35) 



achieve a comparable performance |fTT|. We should mention 
that, if the goal is solely to prevent UR from obtaining a good 
estimate of its channel from the transmitter, the feedback- and- 
retraining scheme actually provides no advantages over the 
two-way DCE scheme, even though it requires more complex 
operations. However, the two-way DCE scheme cannot be 
applied if the channel between LR and UR is also of interest 
at UR, e.g., when LR also has a secret message to transmit. 
In this scenario, the feedback-and-retraining scheme in lUTl 
would be the preferred method. In summary, the two DCE 
schemes actually have their own values and limitations and 
should be deployed depending on the applications. 

IV. Numerical Results and Discussions 

In this section, we present numerical results to demonstrate 
the effectiveness of the proposed DCE schemes. We consider a 
MIMO wireless system as described in Section lTl-Al with Nt = 
4, Nl = 2 and Njj = 2. The elements of the channel matrices, 
H, H„, Hd and G, are i.i.d. complex Gaussian distributed 
with zero mean and unit variance, i.e., cr^ = <j\ — = 
(Tg = 1. The entries of the receiver noise matrices, i.e., W, W 
and V, are also assumed to be i.i.d. complex Gaussian with 
zero mean and unit variance, i.e., = cr^, = cr^ = 1. The 
orthogonal forward pilot matrix is employed, i.e., CfCt = 
C^Ci3 = Itvc Moreover, the training lengths are set to its 
minimum, that is, tu — Nl = 2 and t^? = A'^f = 4 for 
the reciprocal case, and Tfo = Tfi — Nt ^ A and ~ 
Nl ~ 2 for the non-reciprocal case. Let us denote the total 
training length spent by the transmitter as tj and that by LR 
as tl- For the reciprocal case, = 4 (= Tp), tl = 2 {= 
Tfj), and for the non-reciprocal case, Tt = 8 (= Tto + Tts), 
Tl = 6 (= Tto + tl2)- Note that the overall training time 
would be longer than the sum of all training lengths due to 
the processing time at the transmitter and LR. In order to 
investigate the energy tradeoff between the transmitter and LR, 
we consider the general formulation as in (l4&t which has the 
additional total energy constraint in contrast to ([14) and (l38T l. 
We define the average transmit power as Pave — Tt+TL 
that a total energy budget can be alternatively expressed in 
terms of an average power budget. For all simulation results, 
the individual power constraints at the transmitter and LR are 
respectively given by = |j- = 30 dB and Pl = |^ = 20 dB 
(relative to its noise variance). We incorporate an NMSE lower 
bound for comparison. The lower bounds for the reciprocal and 
the non-reciprocal cases are both given by 

This is the minimum achievable NMSE at LR when cr^ ~ 0, 
i.e., no AN is used. 




Pave (dB) 
(a) Reciprocal case 




Pave (dB) 



(b) Non-reciprocal case 

Fig. 2. Power allocation between the pilot signal and AN powers of the 
proposed DCE schemes for the reciprocal and the non-reciprocal cases. 

In Fig. [2] we show the results of the optimal power 
allocation among pilot and AN signals in different stages of the 
training process for both the reciprocal and the non-reciprocal 
cases. The results are shown for two different lower limits 
on UR's NMSE, i.e., 7 = 0.1 (indicated by solid lines) and 
7 = 0.03 (indicated by dashed lines). In the reciprocal case, 
the reverse training power, forward training power, and the 
AN power are defined as Sr/tr, £f/tf and {Nt — -/Vi)(T^, 
respectively. We can see from Fig. |2(a)| that, as the average 
power budget Pave increases, all the training powers and 
the AN power increase at roughly the same rate over the 
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range of Pave from 18 dB to 26 dB. This suggests that the 
optimal percentages of total energy allocated to the reverse 
training, forward training, and AN do not change much over 
a wide range of energy budget. However, when Pave becomes 
sufficiently high, the curves in Fig. |2(a)| start flattening out 
due to the individual power constraints at the transmitter and 
LR. Moreover, by comparing the optimal power allocation of 
the case with 7 = 0.03 and that with 7 = 0.1, we can see 
that it is desirable to allocate more power to AN and less 
power to the forward pilot signal as 7 increases (i.e., when 
a stricter constraint is imposed on UR's performance). This 
is due to the fact that the forward pilot signal benefits both 
LR and UR while AN primarily degrades UR's estimation. 
However, an increase in the AN power may also degrade the 
estimation performance at LR if it is not placed accurately 
in the null space of the channel to LR. To reduce this effect, 
the reverse training power should be increased in order to 
obtain a more accurate knowledge of the downlink channel. 
This explains why the reverse training power increases with 
7 in Fig. |2(a)| Note that, when Pave falls below 10 dB, the 
value of 7 = 0.03 falls out of the feasible range given in (flSl l 
(because the value of 7 is not achievable by UR even without 
AN) and, therefore, is not shown in the figure. Similar trends 
also hold in the non-reciprocal case as shown in Fig. |2(b)| 
where Eto/rto and Sli/tio are the round-trip training powers, 
£l2/tl2 is the reverse training power, Ea/rt^ is the forward 
pilot signal power, and {Nt — Nl)ij1^ is the AN power Notice 
that, since the round-trip and the reverse training work together 
in this case to provide the transmitter with knowledge of the 
downlink channel, both powers should be increased to reduce 
AN's interference on LR. 

In Fig. [3] we show the channel estimation performance 
at LR and UR for different values of average power budget 
Pave - The reciprocal case is shown in Fig. |3(a)| while the non- 
reciprocal case is in Fig. |3(b)| Two different lower limits on 
the UR's NMSE are considered in the figures, i.e., 7 = 0.1 and 
7 = 0.03. We see that our proposed DCE schemes can indeed 
constrain the UR's NMSE above 7 in both cases. In the case of 
non-reciprocal channels, we also compare the approximation 
of LR's NMSE obtained in ( |35] ) with the exact value obtained 
from Monte-Carlo simulations in Fig. |3(b)| We can see that 
the analytical approximation of the NMSE is very close to that 
obtained from Monte-Carlo simulations. 

Finally, in Fig. 2] we show the symbol error rate (SER) 
performance at LR and UR in the data transmission phase. We 
consider the scenario where the transmitter sends a 4 x 4 com- 
plex orthogonal space-time block code (OSTBC). The code 
length is equal to four and each code block contains three 64- 
QAM source symbols [59 1. The data transmission power is set 
equal to the average transmit power budget Pave- Note that this 
4x4 OSTBC is not identifiable 113, flSl, which, as discussed 
in Remark 1, implies that UR would suffer from non-trivial 
rotation ambiguities when either blindly estimating the channel 
or detecting the codeword. Therefore, it is assumed here that 
both LR and UR exploit their channel estimates obtained under 
the proposed DCE scheme, and detect the data symbols by 
using the coherent maximum-likelihood detector (assuming 
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(a) Reciprocal case 




-e-UR, 7 = 0.1 

-B- LR, Monte Carlo, 7 ■- 
LR, approx, 7 = 0.1 
■©-UR, 7 = 0.03 
- B - LR, Monte Carlo, 7 = 
■X-LR, approx, 7 = 0.03 
-Af- NMSE lower bound 
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Pave (dB) 
(b) Non-reciprocal case 

Fig. 3. NMSE performance of the proposed DCE schemes for the reciprocal 
and the non-reciprocal cases. 

that the obtained channel estimates are perfeclH) ||29ll . In this 
Monte-Carlo simulation, the SER is computed by averaging 
over 500, 000 channel realizations and OSTBCs. 

Fig. [4(a)l presents the SER for 64-QAM OSTBC in the 
reciprocal case. We see that the SER at LR gradually improves 
as the average power budget increases, while the SER at UR 
remains larger than 0.1 due to the poor channel estimation. 
Similar trends are also observed in the non-reciprocal case 
in Fig. |4(b)| Both figures illustrate that, with the proposed 
DCE scheme, discrimination of the data detection perfor- 
mances between LR and UR can be effectively achieved. It is 
worthwhile to mention that the feedback-and-retraining DCE 
scheme proposed in ifTTl assumed a perfect feedback channel 
with no power consumption and, thus, it is difficult to have 
a fair performance comparison between the proposed scheme 
and that in ifTTI . 

''LR and UR can also employ the CSI-error-aware detector in II30I Eqn. 
(20)], despite that the associated complexity is much higher especially for 
higher-order QAM. It is anticipated that both LR and UR would have 
improved symbol en'or performance. 
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(a) Reciprocal case: 64-QAM OSTBC 
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(b) Non-reciprocal case: 64-QAM OSTBC 

Fig. 4. SER perfoiinance of LR and UR in an OSTBC system witli ctiannel 
state infomiation (CSI) obtained by the proposed DCE schemes. 



V. Conclusions 

In this paper, we have proposed new DCE schemes based on 
the two-way training methodology for both reciprocal and non- 
reciprocal channels. The proposed design drastically decreases 
the overall training overhead compared to the original DCE 
scheme in fTTj . We obtained analytical results on the MSE 
of the channel estimation and utilized it to derive the optimal 
power allocation among pilot signals and AN. The optimal 
power values in both cases were obtained by minimizing 
the MSE at LR whilst confining the MSE at UR above 
some prescribed value. The presented numerical results have 
demonstrated the effectiveness of the proposed DCE schemes. 

In the current paper, we have derived the DCE scheme 
based on the LMMSE channel estimator. Since UR may not 
be restricted to the use of the LMMSE channel estimator, it 
would be interesting to extend the DCE scheme to other more 
complex channel estimators or even using Cramer-Rao lower 
bound (CRLB) as the performance measure. Furthermore, by 
intuition, the channel condition difference caused by DCE 
should improve the achievable secrecy rate defined in the 
context of information-theoretic security ||T]-||3]. Analytically 



proving this intuition, though challenging, is an interesting 
future research direction. Interested readers may refer to |f35l- 
1421 for some endeavors which aim to characterize the impact 
of channel estimation errors at terminals on the achievable 
secrecy rate. 
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Appendix A 
Proof of Proposition[T] 

In this appendix, we present the solutions for the following 
problem 



mill NMSEz, 



(40a) 
(40b) 



subject to (s.t.) NMSEt^ > 7, 

£r + £f + {Nt - Nl)<jItf < Etot, (40c) 

£p + {Nt - NL)alTp < £t, 



(40d) 
(40e) 



where (I40cb is a total energy constraint and £tot denotes the 
total energy budget. Note that, when £tot > £l + £t, the total 
energy constraint (I40cl i is redundant, and thus (l40t reduces 
to ( fT4l i. The solutions of ( |40] | are given in the following 
proposition. 

Proposition 2. The solutions to (I40l l with 7 chosen according 
to ( 115b are given by considering the following three scenarios 
separately. 

Scenario 1 (Btot > £l + £t)' For this scenario, problem ( 1401 ) 
reduces to problem ( 1141 ). The corresponding solution is given 
in PropositionUl 

Scenario 2 (raax{£L, £t} < £tot 1^ £l + £t)- If 
p > min{fL, £tot ~ 7}, 

the optimal (£r,£f,u'^) is given by £^ ^ 0, £p = j 
and (cTq)* = (i.e., no need of reverse training and no 
need of AN in the forward training). On the other hand, if 
p < minji^L, Btot — 7}, the optimal £fi can be obtained by 
solving the following one-dimensional optimization problem: 



F* 



{NLal+al£R)-£F{£B, 
arg max = — 

NLcrl+al-£n + N,ay^- 



s.t. max{0, /i, £t, 



where 



and 



£f{£r) 




(41) 

£t} < £r 

< mm{£L, Btot - 7}, 



(42) 



(43) 



ACCEPTED BY IEEE TRANSACTIONS ON SIGNAL PROCESSING, JAN. 2013 



11 



The optimal Sp and cr^ are given by £p — £f{S}i) and 

Scenario 3 (Etot < and/or Stot < £t)' This scenario refers 
to the case where one or both individual energy constraint(s) 
are redundant. Here, the solution is given as in Scenario 2 with 
the redundant individual energy constraint(s) set to infinity. 

We first prove the most general scenario, i.e.. Scenario 
2, where both the individual and total power constraints are 
effective. Scenario 1 and scenario 3 are degenerated cases of 
Scenario 2; their corresponding solutions are presented in the 
second subsection. 

A. Proof of Scenario 2 

For notational simplicity, let us define ( = {Nt ~ Nl)(j'^. 
In this case. Problem (fl4] | can be rewritten as 



(44a) 

(44b) 

(44c) 
(44d) 
(44e) 



£r + £f + C ■ tf < £tot, 
£r<£l 

£f + C ■ tf < £t- 

We will analyze the solutions of ( |44] | via the following two 
steps: (i) for any given £fj, find the optimal values of £p and 
C, as functions of £]i; and (ii) find the optimal value of f/j. 
Step (i); Given £b, where < £r < nim{£L,£tot}, the 
optimal values of £p and ( can be found equivalently by 
solving the following optimization problem: 

{NLal+al£j,)£F 



s.t. 



(^'^,£f 



<7, 



£f + C ■ '''F < £tot — £r, 

£f + C ■ Tf < £t- 



(45a) 

(45b) 

(45c) 
(45d) 



Let the solutions to and ( in the above problem be denoted 
by £p{£j^) and <^*{£fi), respectively. To analyze (l45T l. we 
consider the following two cases: 

Case 1 {£tot — 1 < £r < niin{fi, ftof}): Note that the ob- 
iective function in (145b is monotonicallv increasing in £f but 
decreasing in C,. Since 7 satisfies ( [TtI i and £r > £tot — 7, it 
follows that C,*{£r) = and £p{£Fi) = £tot — £r- It can be 
easily verified that C,*{£fi) and £f{£r) are feasible to (|45] |. In 
particular, (145 dl i is satisfied due to £f{£r) = £tot — £r and 
7 < ft by ([TtT i. In this case, the maximum objective value of 
( 145 al ) is equal to £tot — £r, which is less than 7 due to the 
premise of this case. 

Case 2 {£r < minify, £tot — j})- In this case, we first show 
that the constraint in ( I45bb must hold with equality when 
the optimum value is achieved. Suppose that the constraint in 
i45H is inactive at the optimum. In this case, we can always 
decrease C to obtain a larger objective value until the constraint 
(I45bl) holds with equality. If the condition ( I45bb is still inactive 



even when C = 0, we can instead lift £f to achieve a larger 
objective value since < min{£i, £tot ~ 7} and 7 must 
satisfy iVh . Hence, we conclude that the constraint in (I45bl ) 
must hold with equality and, thus. 



£Fi£R) = 7 



Ci£R) + 1 



(46) 



By substituting ( 1461 ) into ( 1451 ). the optimization problem can 
be reduced to 



ial/al-C + l){NLal+al£R)^ 
max — 2 

NL<yl^C + Ni.'yl + <yl£R 

£r - 7: 



S.t. 



(47a) 



(47b) 



(47c) 



We further consider the following two subranges of £fj in 
order to find the optimal C,. Let us iterate the definition of /i 
in ([H 




f I > min{£L, £tot - 7}- 



(a) £fi < /i: It can be shown that the objective function in 
(I47ab is monotonically decreasing in C whenever £fj < 
fi. Therefore, for £11 < ji, the optimal C, of ( |47] | is zero 
and the corresponding optimal objective value is given 
by 7. 

(b) /i < £r'. On the other hand, when /i < £11, the objective 



function in ( 147 ab is monotonically non-decreasing in C,. 
Hence, if £11 < £tot — £t, one can increase C until 
constraint (I47cb is met with equality. In this case, we 
have 

C{£r) = ^*~7, (48) 

TF + (Tglloi 

Conversely, if £fi > max{/i, £tot-'£t}, constraint (I47bb 
must hold with equality at the optimum and, thus, 

£tot — £r 



Ci£R.) 



Moreover, since £/: 



-22 

TF + (Jnl/cr^ ' 



(49) 



'■-R > (J- implies that cFg/af, > 
^'^'^■i '^j^Ji'g^ , the optimal objective value of ( 147 al ). i.e.. 



(50) 



is no less than 7. 
Notice that the maximum objective value obtained in Case 2 
is no less than 7 and, thus, is always greater than that obtained 
in Case 1. 

Step (ii) : Since the maximum objective value in Case 2 is 
always greater than that in Case 1, the optimal value of £11 
must satisfy £r < mm{£L, £tot — 7}- 

If ^ > minjiSi, £tot — 7}, then it follows that £fi < fi 
since £fi < mm{£L, £tot ~ 7}- Therefore, by the results 
of Case 2(a), we have C* = and, thus, = 7 by ( l46b . 
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Since AN is not needed, there is also no need for reverse 
training and, thus, we can set £^ = 0. Ahematively, if 
fj, < min{i?i, £tot — 7}, £r can be chosen to be either 
greater or smaller than ^. However, by the results of Case 
2, we know that a larger objective can be achieved when 
£r > Therefore, the optimal Er must lie in the range 
max{0, fi} < £fl < mm{£L, Stot - l}- 

Let us first consider the subrange max{0, /i} < Er < £tot — 
£t (< £l), if it exists. In this case, the optimal values of 
£p and C, are given by ( |46] l and ( |48] l. which actually do not 
depend on Er. By replacing £p and ^ with their optimal values 
£r{.£f) and C,{£f), the original problem (l44l i can be written 
as 

{NLal+al£R)£U£R) 



max 



NLal+al£R + NLaf^(:*{£R) 
s.t. max{0, ^} < £r < £tot ~ £t- 



(51) 



Notice that, since £p{£r) and C{£r) in ( l46l ) and ( |48] | do not 
depend on £r, the objective function (ISTT i is monotonically 
non-decreasing with respect to £r and, thus, the optimal value 
is achieved with £r = £tot — £t- Therefore, it is sufficient to 
consider £r in the subrange max{0, /x, £tot ~ £t] ^ £r < 
mm{£L, £tot — 7} since it includes the value £r = £tot — £t- 
It then follows from ( l49l l and ( l46l l that the optimal f/j can be 
obtained by solving (l4Tl i. which requires only a simple line 
search over the finite interval. 

B. Proof of Scenario 1 and Scenario 3 

In Scenario 1, where £tot > £l + £t, the total energy 
constraint in (I44cl i is redundant and, thus, £tot can be set as 
infinity. In particular, if fi > £l (and thus /i > £r), we have 
C* = 0, = 7 and £^ = according to Case 2(a). On 
the other hand, if /i < f^, it follows from Case 2(b) that the 
original problem can be expressed as 



max 

£r>0 



s.t 



ai£R)£USR) 



N^al+al-ER + N^al^^ 
max{0, ^Ji\ < £r < Bl, 



■C{Br) 



(52) 



where £p{£r) and C,*{£r) are given by ( |46] | and ( |48] |. Since 
£f{£r) and do not depend on £r in this case, the 

objective in (|52] l increases monotonically with and, thus, 
the optimal value of £jf is given by = f^. This implies that 
both LR and transmitter should transmit with their maximum 
energies in Scenario 3. 

In Scenario 3, where £tot < £l and/or £tot < £t, at least 
one of the individual energy constraints are redundant and, 
thus, can be set as infinity. Therefore, the optimal solution 
can be obtained similarly by solving iAli with the redundant 
constraint(s) (i.e., £l and/or £t) set as infinity. 

Appendix B 
Proof of the Optimal Pilot Matrix Ct 

Consider the optimization of Ct for problem (fl4l . Notice, 
from (fT2l i and (fljl l, that the NMSE at both receivers depend 
only on the value of C^Ct. Let 



be the eigenvalue decomposition of C^Ct, where Uc G 
(^NtxNt jg ^ unitary matrix and D = diag{di, ... ,dNt) is 
a diagonal matrix with di, . . . ,dNf being the eigenvalues of 
C^C. By substituting ^53 into ([Hll and ((B]), we have 

Nt 



and 



NMSEl = f^+a-d, 
1 / 1 



where 



and 



2—1 ^ y 



£F/Nt 



£F/Nt 



Therefore, given £r, £p, and cr^, the optimal can be found 
by solving the following optimization problem 



mm — > [ ^ + a ■ di 

di,...,dMt>oNt'^ \ai 



Nt 

E 

i=l 



(54a) 
(54b) 

d,; = A'^t, > for i = 1, . . . , iVt (54c) 



> 
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where ( I54cb is due to the constraint that Tr(Cf Ct) = A^t. 
By the Karush-Kuhn-Tucker (KKT) conditions, the optimal 
di must satisfy the following conditions 



1 ^\"' nb f I 

+ ^ + bdi 



Nt 

E 

i=i 



+ = 0, i^l,...,Nt (55) 

di = Nt, K > 0, > 0, (56) 



rjidi ^ 0, ?/i > 0, dt>0, i ^ 1, . . . ,Nt, 



(57) 



where k, v and ry^ are the corresponding dual variables of the 
constraints in (I54bb and ( I54cl i. It follows from (l57t that 77^ = 
if di > 0, and therefore, by (l55T l, we can observe that all the 
nonzero di must have the same value. Hence, if there are K 
nonzero d^'s, then, owing to ( l56b , we have di = • • • = dx = 
Nt/K. 

Appendix C 

Derivation of LR's NMSE in Non-Reciprocal Case 

Here we derive the downlink channel estimation perfor- 
mance at the LR. By expressing He; = — AH^ and the 
fact that t = 0, the received signal at LR in 

can be written as 



Cf Ct = U.DUf 



(53) 



Yi3 = CtaHd - AKg AH^.t 

n-d,t 



w. 



(58) 
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where Ct3 = y ^Ct3. Its vector representation is given by 

yL3 = (Ijv^ ® Cts) hd - (Ijv^ ® AK|^ jAhrf,t + W3 (59) 

where = vec(YL3) and W3 = vec(W3). The LMMSE 
estimate of is given by 



where is as defined in ( |36] |. Since A and U are statistically 
independent (321, ( l64l i can be further evaluated as 

(iVt - Nj:)al 



yi,3yi,3 



yL3 



where 



(60) 



(61) 




and 



where the equality follows from the fact that the eigenval- 
ues of the Wishart distributed matrix H^H^ are identically 
distributed ll33l . Substituting (l65T l into ( |62] |. we have an 
approximation of the covariance matrix of 7^3 as 



(62) 



R.V 



Note that the expectation in ( 162b is taken over all the random 
variables including A, Ah^,* and Hd.t, where the last two 
variables actually depend on the value of H„. Using the law of 
iterated expectations, i.e., E,{X} = E{K{X\Y}}, the second 
term of (|62]| can be written as 



lNr.®Wh^Ct3Ci 



{Nt~NL)al(al 



to 



-E 



Ai 



/3/Ai 



IN, 



(66) 



X E{Ahd,tAh^,|Hd,t, H.J(Ijv, ® AK£ p^|H4} (63) NMSE^ = 



Since the NMSE of is 

Tr(E{Ah<iAh^}) 



NtNL 



where we have used the fact that the random matrix A is inde- 
pendent of Ahd.t- Since Ah^ 4 and f are not necessarily in- 
dependent, it is difficult to evaluate E{Ahc;^tAh^j|Hd,t, Hu}. 
To obtain a tractable form, we consider an approximation 
where Ahd.t and t are assumed to be independent, i.e., 
E{Ahd.tAhf,|Hd,i,H.4 « E{Ahd,tAhfJH„}. By m 



Tr 



and the fact that K 
be computed as 

{Nt~NL)al 



equation (l63T l can 



X E- 




^2 T 



H* rrJ 



-to 



'<Jto 



by substituting i6l\ and ( |66] | into (l67b . we obtain an approx- 
imation of NMSEi as shown in ( |68] | (top of the next page). 
To further obtain an approximation for the expectation term in 
(|68] l, we note that, when Nf 3> 1, the distribution of the eigen- 
values of H„H^ can be asymptotically approximated by a 
Gaussian random variable [[34| , that is Ai ~ M{Ntu'^ , Ntu'^). 
Moreover, when Nl is also sufficiently large, cr^ in ( [36b 
is close to zero (i.e., Ai is approximately a constant equal 

to its mean), and thus the term Ea^ 



®In,, (64) 



/3/Ai + l 



can be 



where /? is defined in (|30] l. To further evaluate (|64] i, let us take 
the eigenvalue decomposition of HuH^ as UAU^, where 
U S C^^^^^ is a unitary matrix and A = diag(Ai, . . . , Ajv^) 
is a diagonal matrix containing the unordered eigenvalues of 
H„H^ as the diagonal elements. It is not difficult to show that 
the coefficients of H„ are i.i.d. Gaussian distributed because 
both the uplink channel H„ and the noise matrix W2 are 

i.i.d. Gaussian distributed and that the pilot matrix \J'^^Cl2 
in the reverse training stage (see Section ITlI-Ab is semi-unitary. 
It then follows from results in random matrix theory ll3T| that 
H„H^ has a Wishart distribution with Nt degrees of freedom 
and its mean is given by 



approximated by -^j^ — by the Jensen's inequality. As 
a result, we obtain (l35l l as an approximation of i 



E{H„Hf } = Nt 



-I 



Nr 
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